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SUMMARY 


The main proteinase (M°'°) of the severe acute respi- 
ratory syndrome (SARS) coronavirus is a principal 
target for the design of anticoronaviral compounds. 
Benzotriazole esters have been reported as potent 
nonpeptidic inhibitors of the enzyme, but their exact 
mechanism of action remains unclear. Here we pres- 
ent crystal structures of SARS-CoV MP", the active- 
site cysteine of which has been acylated by benzo- 
triazole esters that act as suicide inhibitors. In one 
of the structures, the thioester product has been 
hydrolyzed and benzoic acid is observed to bind to 
the hydrophobic S2 pocket. This structure also fea- 
tures the enzyme with a shortened N-terminal seg- 
ment (“amputated N finger”). The results further the 
understanding of the important role of the N finger 
for catalysis as well as the design of benzotriazole 
inhibitors with improved specificity. 


INTRODUCTION 


The global 2003 outbreak of the highly infectious severe acute 
respiratory syndrome (SARS) was caused by a new human coro- 
navirus, SARS coronavirus (GARS-CoV) (Ksiazek et al., 2003; 
Drosten et al., 2003; Peiris et al., 2003). Spreading from southern 
China, the disease was responsible for the death of about 10% of 
those infected (Stockman et al., 2006). Since 2003 there have 
been two isolated mini-outbreaks in China, but they have been 
contained in their early stages. These cases remind us of the 
lingering danger to human health represented by SARS-CoV 
as well as other coronaviruses, and the need for effective anti- 
coronaviral therapeutics. Since the global SARS outbreak, two 
new coronaviruses infecting humans have been discovered 
(van der Hoek et al., 2004; Fouchier et al., 2004; Woo et al., 
2005). HCoV NL63, a coronavirus associated with croup (Chiu 
et al., 2005), appears to be the most widespread member of 
the entire family. It is the causative agent of up to 10% of all re- 
spiratory diseases (Pyrc et al., 2006 and references therein). De- 
tected in 13 patients in Hong Kong, HCoV HKU1 is a coronavirus 
associated with relatively mild, community-acquired pneumonia 
characterized by fever, running nose, and cough (Lau et al., 
2006), although a subtype of HKU1 causing a severe obstructive 


pneumonia was recently identified (Kupfer et al., 2007). Two 
other human coronaviruses, HCoV 229E and OC43, have been 
known for decades, and are believed to cause mild upper respi- 
ratory tract illnesses, including the common cold (Mcintosh 
et al., 1967). Furthermore, HCoV OC43 has also been reported 
to have a high prevalence in the brains of multiple sclerosis 
patients (Arbour et al., 2000). 

Coronaviruses are enveloped, positive-sense RNA viruses. 
Their large single-stranded RNA genome ranges from 27 to 
31 kb (29.7 kb for SARS-CoV) and encodes several structural 
and auxiliary proteins as well as two large overlapping polypro- 
teins, pp1a (replicase 1a, ~486 kDa for SARS-CoV) and pptab 
(replicase 1ab, ~790 kDa) (Thiel et al., 2003). These polyproteins 
are subsequently cleaved to give rise to the individual nonstruc- 
tural proteins essential for building the viral replicase complex. 
In most coronaviruses, this maturation process is performed by 
extensive proteolytic processing by three cysteine peptidases. 
There are two papain-like proteinases (PL®'°) that are responsible 
for acting on three cleavage sites near the N terminus of the poly- 
proteins (Ziebuhr et al., 1995), and the main proteinase or MP'° 
(also called “3C-like protease” or 3CLP'°), which is necessary 
for cleaving at no less than 11 conserved sites containing a large 
hydrophobic residue (preferably Leu) in the P2 position, a Gln in 
the P1 position, and a small aliphatic amino acid residue (Ser, 
Gly, Ala) in the P1’ position (Ziebuhr et al., 2000; Hegyi and Zie- 
buhr, 2002; Anand et al., 2003). In contrast to other coronaviruses, 
the SARS-CoV genome encodes only one PL?” in addition to the 
MP'°. Besides being involved in the processing of the polypro- 
teins, the PLP’° has a second function as a deubiquitinating en- 
zyme (Barretto et al., 2005; Lindner et al., 2005). Because of its 
pivotal role in viral replication, the 33.8 kDa main proteinase has 
become the prime target in the development of inhibitors directed 
at SARS-CoV and other coronaviruses (Anand et al., 2003, 2005). 

So far, the crystal structures of the main proteinases of four 
coronaviruses have been determined: transmissible gastroen- 
teritis virus (TGEV) (Anand et al., 2002), HCoV 229E (Anand et al., 
2003), SARS-CoV (Yang et al., 2003; Tan et al., 2005; Xu et al., 
2005; Xue et al., 2007), and infectious bronchitis virus (IBV) 
(Xue et al., 2008). Except for IBV, the enzyme is a homodimer, 
and this is the quaternary structure observed in solution as well 
(Anand et al., 2002; Fan et al., 2004; Hsu et al., 2005a). In fact, 
it has been proposed that dimerization is essential for catalytic 
activity because the very amino terminus (‘N finger”) of one sub- 
unit is involved in organizing the substrate-binding pocket of the 
other (Anand et al., 2002; Yang et al., 2003; Tan et al., 2005). 
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Figure 1. Dimer of the SARS CoV MP'° 

Ribbon diagram and surface representation of the dimeric SARS-CoV MPr° 
(protomer A in blue, protomer B in light orange). The catalytic dyad residues 
(Cys145 and His41) in both monomers colored by atom (yellow, carbon; red, 
oxygen; blue, nitrogen; green, sulfur). The N finger (residues 1-7) is colored 
magenta for monomer A and green for monomer B. The very N terminus is 
marked by a sphere. It interacts with residue Glu166 of the opposing mono- 
mer. The oxyanion loop is colored red for monomer A and dark brown for 
monomer B. 


Each monomer of the coronaviral MP°'° consists of three 
domains. Domains | (residues 1-101) and II (residues 102-184) 
each consist of an antiparallel § barrel; taken together, domains 
| and Il resemble the structure of chymotrypsin-like serine pro- 
teinases (Anand et al., 2002, 2003; Yang et al., 2003; Tsukada 
and Blow, 1985). The active site of the MP'° is located in the 
interface between the two domains and consists of a Cys-His 
catalytic dyad, in which the cysteine acts as the nucleophile in 
the proteolytic cleavage reaction (Anand et al., 2002, 2003; 
Yang et al., 2003). Connected by a long loop (residues 185- 
200) to domain II, domain III is all-helical and not only involved, 
to some extent, in dimer formation but also in placing the N finger 
next to the substrate-binding site of the other monomer. In the 
active conformation of the protein, the individual binding pockets 
for the respective amino acid residues of the substrate are 
accessible and the oxyanion loop (residues 138-145) has the 
correct shape to donate two hydrogen bonds from main-chain 
amides to stabilize the tetrahedral transition state of the proteol- 
ysis reaction. The two monomers in the dimer are arranged 
perpendicular to each other (Figure 1; Yang et al., 2003). 

A number of crystal structures with small-molecule inhibitors 
bound to the coronavirus MP’ have been reported. Most of these 
compounds are peptide analogs derived from the substrate 
cleavage consensus sequence, such as peptidyl chloromethyl 
ketones (Anand et al., 2003; Yang et al., 2003), vinylogous alkyl 
ester derivatives serving as Michael acceptors (Yang et al., 
2005), aza-peptide epoxides (Lee et al., 2005, 2007), or «,B-ep- 
oxiketones (Goetz et al., 2007). Peptidyl aldehydes have been 
described as reversible inhibitors (Al-Gharabli et al., 2006; 


Chemistry & Biology 
SARS-CoV MP" Inactivated by Benzotriazole Esters 


Schmidt et al., 2008). Also, structure-based virtual screening 
approaches have led to the discovery of anumber of nonpeptidic 
inhibitors (Wu et al., 2004; Bacha et al., 2004; Chen et al., 2005b; 
Kaeppler et al., 2005; Tsai et al., 2006), but until now, only a few 
crystal structures of such complexes with the protease have 
been reported (e.g., Lu et al., 2006). 

In the early stages of the 2003 SARS outbreak, one of the first 
drugs tried for the treatment of patients was ribavirin, an antiviral 
drug commonly used against a number of DNA and RNA viruses 
(Sidwell et al., 1972), but this proved to be unsuccessful (Stock- 
man et al., 2006; Knowles et al., 2003). Left with few other 
options, physicians also tested HIV proteinase inhibitors (Stock- 
man et al., 2006), and Wu et al. (2006) found that one of them, 
lopinavir, targeted SARS-CoV MP” with an ICs9 of ~50 uM. In 
their search for derivatives of lopinavir with an improved inhibi- 
tory action toward MP’°, Wu et al. (2006) noticed by serendipity 
that the intermediate benzotriazole esters occurring during the 
synthesis of these derivatives were better inhibitors of the 
enzyme than the final products (see also Hilgenfeld and Pumpor, 
2006). These benzotriazole esters are the result of the activation 
of carboxylic acids by 2-(1H-benzotriazole-1 -yl)-1,1,3,3-tetra- 
methyluronium hexafluorophosphate (HBTU), which was used 
as a coupling reagent in the synthesis of the lopinavir-like mole- 
cules. Wu et al. (2006) reported K; = 7.5 nM and Kinact (=kK3) = 
0.0011 s~' for the best of these inhibitors and suggested, on 
the basis of mass spectrometric analyses, that the benzotriazole 
esters irreversibly acylate the active-site Cys145. However, even 
though these K; values are among the lowest for any inhibitor of 
the SARS-CoV MP'° described so far, Wu et al. (2006) also re- 
ported that full inhibition of the enzyme could not be achieved. 
Also, upon replacement of the ester oxygen by a methylene 
group, the inhibitors bound noncovalently to the enzyme, with 
K; values in the micromolar range (Wu et al., 2006). Such a mech- 
anism might also apply, to some extent, to the benzotriazole 
esters themselves, perhaps explaining the incomplete inhibition 
reported by Wu et al. (2006). Because of these uncertainties, we 
wanted to visualize the binding mode of benzotriazole esters to 
the SARS-CoV main proteinase, and have determined the crystal 
structures of the enzyme after incubation with two such 
compounds, 1-(benzoyloxy)-benzotriazole (compound XP-27) 
and 1-(4-dimethylaminobenzoyloxy)-benzotriazole (compound 
XP-59) (See Figure 2). 


RESULTS AND DISCUSSION 


Chain Termini of the SARS-CoV M°’? Influence 

Catalytic Activity 

Two different gene constructs coding for the SARS-CoV main 
proteinase were used in this study. One of them ([ASer1]/ 
C-Hisg) has the last two residues at the C terminus of the protein 
(Phe305 and Gin306) replaced by a hexahistidine tag, while the 
codon for the N-terminal serine has been replaced by the start 
codon, AUG. As we can demonstrate by Edman degradation 
(not shown), the corresponding formylmethionine residue was 
cleaved off during synthesis in Escherichia coli, most probably 
by methionyl-aminopeptidase (Hirel et al., 1989), creating 
A(Ser1)/C-Hisg MP'°. The other construct (‘authentic MP”) car- 
ries an N-terminal GST-AVLQ tag that is autocleaved by the pro- 
tease itself, and a C-terminal GP-H, tag which can be removed 
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N, x N x X. Figure 2. Chemical Formulas and Reaction 
CL ‘N ‘N ‘ag Mechanism 
N N S\7Cys Formulas for compounds XP-27 (X = H) and XP-59 
oO ———>+> yO —> oO (X = [CHs]oN) and proposed reaction mechanism 
Cy (Wu et al., 2006). 
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by PreScission protease, so that the final proteinase product 
possesses the authentic N and C termini (Xue et al., 2007). The 
difference in enzymatic activity and dimer stability between 
the two constructs illustrates the importance of an intact 
N terminus (N finger). The kcat/K of the (ASer1)/C-Hisg protein, 
determined by using an HPLC-based assay with the pentadeca- 
peptide SWTSAVLQ|SGFRKWA as a substrate, was 61.9 
(£1.5) mM~'min~" (Tan et al., 2005). Using a FRET-based assay 
with a fluorogenic peptide substrate (Dabcyl-KTSAVLQ | 
SGFRKME-Edans), this value was 167.9 mM ‘min’. The Kp 
value for the monomer-dimer equilibrium of this enzyme species 
was estimated at =250 nM by using the method of Kuo et al. 
(2004), much lower than the values derived for a similar construct 
by Graziano et al. (2006) from small-angle scattering and chem- 
ical crosslinking. Accordingly, the fraction of catalytically active 
(ASer1)/C-Hisg dimers at the enzyme concentration used in the 
assays (1.0 uM with respect to monomers) is about 0.8, and 
the kca/Ky of this enzyme species has to be corrected to 
=209 mM 'min~'. The MP? with the authentic chain termini, 
however, is significantly more active under the same circum- 
stances and has a Kcat/Ky value of 502.3 mM~'min~'. For the 
latter enzyme, the Kp value for dimer dissociation was estimated 
to be significantly lower than 100 nM, in agreement with most 
reports in the literature (Kuo et al., 2004; Hsu et al., 2005b). 
This would indicate that > > 90% of the authentic M°"° molecules 
exist as dimers at the concentration of 1.0 uM (with respect to 
monomers) used in the assays. Occasionally, much higher Kp 
values have been reported for SARS-CoV MP’? dimer dissocia- 
tion in the literature (e.g., Fan et al., 2004; Chen et al., 2005a), 
but in most of these cases, the enzyme did not have authentic 
chain termini. 

For the authentic enzyme, we determined Ky = 2.2 mM and 
Keat = 1105 min~', whereas for (ASer1)/C-Hisg, the respective 
values were 0.24 mM and 40.3 min“! (prior to correction for 
dimer dissociation; see above). Thus, the authentic enzyme, 
although displaying a 10-fold larger Kj, value, exhibited a 27- 
fold higher activity than (ASer1)/C-Hisg. The crystal structures 
presented below provide an explanation for this observation. 


Kinetic Data for the Benzotriazole Inhibitors 

We found the benzotriazole derivatives to be potent inhibitors of 
SARS-CoV MP'°, capable of inhibiting half of the authentic 
enzyme activity at low micromolar concentrations (<5 uM and 
0.1 uM for XP-27 and XP-59, respectively), as determined using 
the fluorogenic peptide cleavage assay. Under the assumption 
that the inhibitors act as suicide substrates by covalently binding 
to the proteinase, a description of the compounds requires 
determination of both the equilibrium binding constant K; and 
the inactivation rate constant (for covalent bond formation) 


kg (=Kinact), according to the following scheme (Scheme I, Equa- 
tion VIII.127 of Dixon and Webb, 1979): 


Ki kg 
E SEI> EH 
Ks \t 
ES 


In competition with the fluorescent peptide substrate (S), the 
inhibitor initially forms a Michaelis complex with the proteinase, 
which subsequently undergoes a nucleophilic attack by 
Cys145 onto the ester bond, leading to the formation of a cova- 
lent enzyme-inhibitor product (see Figure 2 for chemical formu- 
las and reaction mechanism). The leaving group of the reaction 
is 1-hydroxybenzotriazole. For the inhibitor XP-59, the K; and 
ks values were determined as 1.38 1M and 0.013 s~', respec- 
tively. At an inhibitor concentration of 1.3 uM, approximately 
97% inhibition of the enzyme was achieved. For the inhibitor 
XP-27, the exact determination of the inhibition constant proved 
difficult because of the inherent instability of this compound. UV 
absorption of XP-27 decreased in anhydrous dimethyl sulfoxide 
(DMSO) over time, even in the absence of enzyme, and this pro- 
cess was significantly enhanced by addition of a few microliters 
of water. Addition of XP-27 to the enzyme led to rapid inactiva- 
tion, but after about 3-4 min, the enzyme activity started to re- 
cover (Figure 3A). A possible reason for this behavior is that 
the rather instable thioester product may have been partially 
hydrolyzed. As we shall see below, our crystal structure of the 
enzyme acylated by XP-27 provides an explanation for this ob- 
servation. In contrast to XP-27 and its thioester product with 
the enzyme, XP-59 and its covalent MP?" adduct were more sta- 
ble, because the electron-donating nature of the p-dimethyla- 
mino group decreased the electrophilicity of the carbonyl C 
atom. The inhibition curve (fluorescence versus time) of XP-59 
displayed the expected hyperbolic shape (Figure 3B). 

Wu et al. (2006) reported a K; value of 17.4 nM and a kg of 
0.0013 s~' for XP-59 (their compound 4). We have no immediate 
explanation for the discrepancy by a factor of 80 between the kK; 
value reported by these authors and our findings, nor for the fac- 
tor of 10 between their and our k3 values. We also do not know 
whether the keat/Ky value of their enzyme preparation is compa- 
rable to ours. 


Binding of 1-(Benzoyloxy)-Benzotriazole (Compound 
XP-27) to SARS-CoV MP'° (Complex 1) 

Complex 1 was obtained by soaking crystals of (ASer1/C-Hisg)- 
SARS-CoV MP” at pH 6.5 with 5 mM 1-(benzoyloxy)-benzotria- 
zole (XP-27). The plate-like crystals displayed space group P2, 
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Figure 3. Inhibition Kinetics of Benzotria- 
zole Esters 

Inhibition kinetics of benzotriazole esters XP-27 (A) 
and XP-59 (B). The graphs show the fluorescence 
(in arbitrary units) created by substrate cleavage 
as a function of time. For inhibitor XP-27, the curve 
initially reaches a plateau, but after about 3-4 min, 
enzymatic activity seems to be restored (A). 
For XP-59, the expected hyperbolic curve is 
observed (B). 


time [min] 


(Table 1) and contained one MP’? dimer in the asymmetric unit. 
One monomer of the dimer (protomer A) was found to be in the 
active conformation, whereas the other exhibits an inactive con- 
formation in which the loop forming the oxyanion hole (residues 
138-145) has a catalytically incompetent shape. This has previ- 
ously been observed in other SARS-CoV MP" structures when 
the crystals had been grown at pH 6.0-6.5, and we have postu- 
lated that the inactive form in the dimer arises when a histidine 
residue (His163) at the bottom of the S1 specificity pocket is pro- 
tonated at low pH, leading to a reorientation of Glu166 and to 
the collapse of both the S1 pocket and the oxyanion hole 
(Yang et al., 2003; Tan et al., 2005; Anand et al., 2005; see, for 
instance, Protein Data Bank [PDB] ID code: 1UJ1). 


Table 1. Statistics of Data Collection and Processing, and 
Crystallographic Refinement 


XP-27 XP-59 

Data Collection 
Space group P2, C2 
Unit cell dimensions 

a (A) 52.23 108.27 

b (A) 97.76 81.97 

c (A) 67.71 53.42 

B(°) 103.01 104.3 
Wavelength (A) 0.8157 0.8080 
Resolution range (A) 40-2.25 65-1.95 

Outer shell (A) 2.34-2.25 2.02-1.95 
Unique reflections 31,491 30,270 
Mean redundancy* 3.2 (3.2) 3.4 (3.4) 
Completeness (%)* 99.7 (100.0) 97.4 (96.8) 
Rimerge (%)*” 7.3 (46.6) 12.3 (44.5) 
ol) 12.9 (1.91) 12.8 (2.49) 
Refinement 
R factor (%)° 18.30 16.48 
Free R factor (%)° 25.59 21.01 
Root-mean-square deviations from ideality 

Bonds (A) 0.016 0.016 

Angles (°) 1.57 1.64 


“Values in parentheses are for the highest resolution shell. 

e Rmerge = > snki>ill — <1 >| Doi! X 100%, where / and </> are the /th 
observed intensity and average intensity of the reflection hk/, respec- 
tively. 

°R factor = SY nxil |Fobs| _ \Featcl|/S-nilFobs| x 100%. 

“Free R factor = Tnx < 1l|Fobs| — |Featell/Y nxt < TlFobs| X 100%, where 
hkI < T represents the test data set of 5% of the diffraction data. 


time [min] 


When examining the electron density for monomer B, extra 
density was seen connected to the active-site Cys145 into which 
we could model a covalently bound benzoyl ester, with an occu- 
pancy of 70% (Figure 4A). The benzene ring of the inhibitor lies 
like a lid on top of the entrance to the S1 pocket and pushes 
away the Glu166 side chain, which is originally (30% occupancy 
remaining for this conformation) blocking the pocket. The O«2 
atom of the reoriented Glu166 now makes a hydrogen bond to 
the Ne2 atom of His172 (2.54 A), just like in the active conforma- 
tion (Tan et al., 2005). Presumably, the highly reactive and rela- 
tively small inhibitor induces this conformation upon binding, 
but its steric demands on the oxyanion loop are too limited to 
force this segment into the active conformation. The benzene 
ring of the covalently bound inhibitor makes van der Waals 
contacts with the rim of the collapsed oxyanion loop (Ca atoms 
of Asn142 and Gly143). 

Surprisingly, the immediate active site of monomer A is empty 
and contains no electron density for a covalently bound product. 
However, in the hydrophobic S2 specificity pocket, we found 
clear difference density (>40) for a benzoic acid molecule. The 
molecule is sandwiched between the side chains of Met49 and 
Met165. The latter adopts two conformations, one of which (oc- 
cupancy 50%) is not compatible with the presence of benzoic 
acid at this site. The occupancy of the benzoic acid molecule 
was fixed at 30%. The observation of this molecule in the S2 
site immediately raises the question as to which mechanism 
was at work here. We assume that the thioester formed between 
the benzoyl group and Cys145 (with 1-hydroxybenzotriazole be- 
ing the leaving group) is attacked by one of the numerous water 
molecules in the substrate-binding site. This results in the pro- 
duction of benzoic acid and restoration of the free active-site 
cysteine. Because of its overall hydrophobicity, the benzoic 
acid then binds to the nearby S2 pocket (Figure 4B). This inter- 
pretation is in full agreement with the observed biphasic kinetics 
for XP-27 (Figure 3A). However, even though benzoic acid itself 
is not an inhibitor of the enzyme up to uM concentrations (data 
not shown), we cannot exclude that the compound bound to 
the S2 pocket might originate from degradation of free XP-27 
in solution over the time of the crystal-soaking experiment. In 
any case, our findings help explain the observation of Wu et al. 
(2006) that their benzotriazole inhibitors, although displaying 
nanomolar K; values, did not lead to complete inhibition of the 
enzyme. 

If hydrolysis of the enzyme-bound thioester occurs with 
XP-27, why then only in the A monomer? Only in this molecule 
is the oxyanion loop in the correct conformation to stabilize the 
tetrahedral intermediate of the hydrolysis reaction. In the B 
monomer, this loop is in the catalytically incompetent 
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Figure 4. Active-Site Environment of the 
SARS-CoV MP'° Reacted with 1-(Benzoy- 
loxy)-Benzotriazole 

Active-site environment of the SARS-CoV MPr° 
reacted with 1-(benzoyloxy)-benzotriazole (XP- 
27), with corresponding 2F, — F, electron density 
map (contoured at 10 above the mean). 

(A) Monomer B with Cys145 acylated by the 
1-(benzoyloxy) moiety (70% occupancy; atom 
colors), which covers the S1 specificity pocket. 
An F, — Fe omit map (green), contoured at 2.750 
above the mean, is shown for the inhibitor moiety. 
Glu166 (red) has a double conformation, one of 
which exists in the 30% of the molecules that do 
not have the active-site cysteine acylated. Met49 


and Met165 (double conformation) (orange) line the S2 specificity pocket. His163 and His172 are colored in magenta. The catalytic dyad residues (Cys145 
and His41) are colored by atom (yellow, carbon; red, oxygen; blue, nitrogen; green, sulfur). Loop 138-145 is in an inactive conformation (Phe140 turned away 


from His163) and colored gray. 


(B) Monomer A represents the structure after hydrolysis of the thioester. The resulting benzoic acid molecule (atom colors; F, — F, omit map, contoured at 2.750, 
shown in green) has entered the S2 pocket and is sandwiched between Met49 and Met165 (orange). The latter has two conformations, one of which exists only in 
the 50% of the molecules that do not have the benzoic acid bound. The oxyanion loop (gray) is in an active conformation, with Phe140 stacking against His163 
(magenta). The 2F, — F, electron density maps (blue) are contoured at 10 above the mean. 


conformation and, therefore, the thioester adduct remains rela- 
tively stable (even though the occupancy is only 70%). 


Amputation of the N Finger in (ASer1)/C-Hisg M°"° Leads 
to Local Structural Changes 

Apart from some small differences, mostly in flexible loops at the 
periphery of the protein and at the C terminus, the dimeric struc- 
tures seen in the enzyme modified by XP-27 and the original 
1UJ1 dimer (Yang et al., 2003) overlay quite well. The overall 
root-mean-square deviation for 577 out of 600 Ca atoms is 
0.36 A. Interestingly, we found electron density for three 
DMSO molecules, which must originate from the DMSO used 
to dissolve the inhibitor in the crystal-soaking experiment. One 
of them fills a hole at the site where the N fingers of the two poly- 
peptide chains approach each other, near the carbonyl groups of 
the Lys5 residues of both the A and the B monomer, that is, more 
or less on the noncrystallographic two-fold axis of symmetry 
(near the position where the magenta and green polypeptide 
segments meet in Figure 1). This site could be of interest for 
designing dimerization inhibitors. The two remaining DMSO 
molecules fill the space between the MP'° dimers adjacent to 
the crystal contacts, next to the AspB244...His#A134 (DMSO2) 
and the GInA256...Trp#A31/Ala#A70 (DMSO3) contacts (# marks 
a residue in a symmetry-related dimer). 

Because of the cloning procedure used in the present study, 
the N-terminal serine has been replaced by a formylmethionine 
which, favored by the penultimate residue being a glycine, was 
cleaved off during synthesis in E. coli (Hirel et al., 1989). This 
causes the “amputated N finger” (residues 2-7, ASer1) to be- 
come more flexible. As a result, residues Gly2 and Phe? in 
both monomers A and B have flipped their orientation such 
that the N terminus of Gly2 is now, at least in protomer B, hydro- 
gen bonded to the Sy atom of Cys300 (domain III) of the parent 
monomer (at 2.39 Ain protomer B, 4.50 Ain protomer A) and the 
N finger can no longer stabilize the loop (residues 138-145) form- 
ing the oxyanion hole of the other monomer. This is observed in 
monomer B, where the oxyanion loop is in a collapsed conforma- 
tion as discussed before. Yet, in monomer A, this loop is in the 


catalytically competent form although the main-chain conforma- 
tion of Phe140 is no longer stabilized by the N finger of monomer 
B. But the phenyl ring of this residue is still embedded in a hydro- 
phobic pocket consisting of Val114, Alai16, Tyr126, Tyr161, 
His163, and His172. The lack of an intact N finger, however, 
does make the loop 138-145 more flexible. This is reflected in 
the B factors of the main-chain atoms of the loop in monomer 
A (44.4-74.0 A?, with an average of 64.3 A), which are higher 
than the average B factor of the entire main chain (40.4 A?). 
This situation is comparable to the B factors observed in the 
loop 138-145 of the B monomer (47.1-77.8 A?, with an average 
of 67.9 A2: the average B factor for the entire main chain is 
41.7 A2). The high flexibility of the oxyanion-binding loop explains 
the fact that k,t for the enzyme with the amputated N finger is 
significantly lower (40.3 min~') than for the authentic MP 
(1105 min~'). On the other hand, deletion of the N-terminal serine 
may create more space in the substrate-binding site, hence the 
lower Ky for the (ASer1)/C-Hisg enzyme. 

We have shown in this work that deletion of Ser1, Phe305, and 
GlIn306, plus the C-terminal attachment of a hexahistidine tag 
(not seen in the electron density maps), reduces enzymatic activ- 
ity (Kcat/Km) by = 59%. In another study, Hsu et al. (2005a) have 
shown that a deletion of the three N-terminal amino acid residues 
reduces the enzyme’s activity by a mere 24%, whereas the 
fourth residue, arginine, is absolutely essential for catalytic activ- 
ity and important for the formation of the dimer. In agreement 
with this, we previously reported almost total inactivation of the 
homologous main proteinase from TGEV upon removal of five 
residues from the N terminus (Anand et al., 2002). In another re- 
port, removal of the seven N-terminal residues abolishes the 
enzymatic activity of the main proteinase almost completely, 
but does not seem to have much influence on dimer formation 
(Chen et al., 2005a) (however, the enzyme used in this study 
had both N- and C-terminal extensions). It has been suggested 
by molecular dynamics simulations that a different dimer could 
form under these conditions, which involves much more elabo- 
rate contact between domains III of the monomers (Chen et al., 
2005a). This would be in agreement with the observed 
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Figure 5. Active-Site Environment of the 
SARS-CoV MP"° Reacted with 1-(4-Dimethy- 
laminobenzoyloxy)-Benzotriazole 

Active-site environment of the SARS-CoV MP" re- 
acted with 1-(4-dimethylaminobenzoyloxy)-ben- 
zotriazole (XP-59), with corresponding 2F, — F, 
electron density map (contoured at 10 above the 
mean). 

(A) Cys145 is acylated by the 1-(4-dimethylamino- 
benzoyloxy) moiety (atom colors; color code as in 
Figure 4), which extends toward the S2 specificity 
pocket. An F, — F, omit map (green), contoured at 
2.756 above the mean, is overlaid onto the inhibi- 
tor moiety. Met49 and Met165 (orange) line the S2 
specificity pocket. His163 and His172 are colored 
in magenta. The catalytic dyad residues (Cys145 
and His41) are colored by atom. Loop 138-145 is 


in an active conformation and colored gray. The 2F, — F, electron density map is contoured at 10 above the mean. 
(B) Comparison of the active site of the free enzyme (light orange) with the 1-(4-dimethylamino)-benzoyl thioester (gray). 2F, — F, electron density (contoured 
at 10 above the mean) is shown for the thioester and His41. The latter has rotated in comparison to the free enzyme and is now stacking against the aromatic 


moiety of the inhibitor. 


dimerization of isolated domain III of SARS-CoV MP’? (Shi et al., 
2004). Although the N-terminal residues obviously have an effect 
on the stability of an intact substrate-binding site, the catalytic 
activity of the M°'° is probably mainly controlled by the proton- 
ation states of the two histidine residues, His163 and His172, 
in the substrate-binding site. Molecular dynamics calculations 
performed on the MP dimer have emphasized the role played 
by these two histidines in maintaining the correct conformation 
of the substrate-binding site (Tan et al., 2005). 


Binding of 1-(4-Dimethylaminobenzoyloxy)- 
Benzotriazole (XP-59) to SARS-CoV M°"° (Complex 2) 
Complex 2 was obtained by cocrystallization. SARS-CoV MPr° 
(with authentic chain termini) was incubated for 2 hr at 25°C 
with a 7-fold excess of 1-(4-dimethylaminobenzoyloxy)-benzo- 
triazole (compound XP-59), and crystal growth was initiated by 
microseeding using crushed crystals of the free enzyme. Small 
block-sized (0.2 mm x 0.1 mm x 0.1 mm) crystals appeared 
overnight and displayed monoclinic space group C2 (Table 1), 
isomorphous to the crystals of the free enzyme (Xue et al., 
2007). These crystals, grown at pH 6.0, have one monomer in 
the asymmetric unit, which has the substrate-binding site in the 
active conformation. Formed by the crystallographic two-fold 
axis from the monomer, the dimer is very similar to the one in 
the P2, crystal structures, except for the oxyanion loops of 
both monomers being in the active conformation. The largest 
differences occur between the C termini of the monomers and 
between the oxyanion loops of monomer B, which is in the col- 
lapsed conformation in the P2, crystal structure. The C terminus 
in the C2 crystal structure is completely visible and extends in a 
direction different from the apparently flexible C terminus in the 
P2, crystal structure. The root-mean-square deviation for 584 
out of 603 Ca atoms with the dimer observed in space group 
P2, (Yang et al., 2003; PDB ID code: 1UJ1) is 0.83 A, whereas 
this value is only 0.23 A (297 out of 306 Ca pairs) for the compar- 
ison with the enzyme with authentic chain termini that was 
crystallized in space group C2 (Xue et al., 2007). 

Upon inspection of the active site of the M?"°-inhibitor com- 
plex, clear electron density was observed for the 1-(4-dimethyla- 


mino)-benzoyl moiety covalently bound to Cys145 (Figure 5A). 
Surprisingly, the side chain of the other catalytic dyad residue 
His41 has undergone a rotation away from Cys145 (Figure 5B). 
It is now stacked against the phenyl ring of the inhibitor, making 
good 7-7 interactions (distance between the planes is 3.4 A). 
The S1-binding pocket is not occupied by the inhibitor. Instead, 
a chain of three water molecules with well-defined density con- 
nects the His163 Ne2 atom at the bottom of the S1 specificity 
site with the side chains of Glu166 (Os1) and Asn142 (061). 
The dimethylamino moiety of the inhibitor is situated be- 
tween the side chains of Met49 and Met165, where it partly 
occupies the S2 specificity pocket. It is also involved in van 
der Waals interactions with the main-chain atoms of residues 
Asp187, Arg188, Gln189, and Pro39. A similar binding mode of 
the inhibitor to the main proteinase is observed when C2 crystals 
of the free enzyme are soaked at pH 7.0 with a 20-fold excess of 
compound XP-59 (data not shown). Binding of a substituted 
phenyl group to the S2 specificity subsite has also been 
observed in other studies (e.g., Goetz et al., 2007). 

In the crystal structure of complex 2, the loop 138-145 is being 
held in the active conformation by the N finger of the symmetry- 
related molecule (distance SerB1 N—PheA140 O: 2.67 A) in order 
to create the perfect environment for the oxyanion transition 
state. The conformation of this loop is quite stable, as can be de- 
duced from the B factors of the main-chain atoms of the loop 
compared to the overall B factor of the entire main chain (ranging 
from 24 to 34 A? with an average of 28.6 A?, compared to 
31.5 A’). Equally, the amino terminus of the N finger is also hydro- 
gen bonded to Glu166 O¢1 (at 2.75 A), preventing closure of the 
S1 specificity pocket by the Glu166 side chain. This catalytically 
competent conformation explains the high Kgat (1105 min™') ob- 
served for the authentic enzyme, when compared to the situation 
in the enzyme featuring the amputated N finger ([ASer1]/C-Hisg; 
Keat = 40.3 min’). 

Comparison of the crystal structures of the two MP"°-inhibitor 
complexes shows that the covalently bound suicide-inhibitor 
products are oriented in different directions (cf. Figures 4A and 
5A). The benzoyloxy moiety in complex 1 (molecule B) is covering 
the S1 specificity pocket (Figure 4A), whereas the dimethylamino 
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group in complex 2 is entering the S2-binding site (Figure 5A). In 
an attempt to explain how the inhibition can proceed, Wu et al. 
(2006) have modeled a number of their benzotriazole inhibitors, 
among them XP-59, into the active site of the SARS-CoV MP". 
In their model of the Michaelis complex, the benzotriazole is 
positioned in the oxyanion hole environment and the remainder 
of the inhibitor is located in the vicinity of amino acid residues 
Thr25, Thr26, His41, Thr45, Ala46, and Met49, leaving the car- 
bonyl group of the benzotriazole ester close enough to the Sy 
of the active-site Cys145 for a nucleophilic attack to occur. 
This orientation of the inhibitor is possible, but then the resulting 
covalently bound products have to swing at least 90° around the 
covalent bond to occupy an energetically probably more favor- 
able niche on top of the S1-binding site or in the S2 pocket in 
the two crystal structures, respectively. Because of the rather 
small size of the products of suicide inhibition, this should be 
possible without serious steric clashes. That structural rear- 
rangements must have happened in the enzyme after the reac- 
tion with the inhibitor is clear in the crystal structure of MPr° 
complexed with the dimethylaminobenzoyloxy moiety (from 
compound XP-59) where, first, the oxygen atom proposed to 
be stabilized in the oxyanion hole (formed by the main-chain 
amides of Gly143 and Cys145) during the transition state of the 
reaction is flipped away and makes no specific contacts any- 
more and, second, the side chain of the active-site residue 
His41 has moved away from its original position to make the 
stacking interactions with the inhibitor product as discussed 
previously (Figure 5B). 


SIGNIFICANCE 


Caused by a newly identified coronavirus, SARS-CoV, se- 
vere acute respiratory syndrome (SARS) is an emerging dis- 
ease that has the capacity to become a global threat to world 
health because of its rapid transmission. The animal reser- 
voir for the virus (most probably bats) is still existent and a 
renewed zoonotic transition to man cannot be excluded. 
Therefore, efficient drugs are needed in order to be prepared 
for a possible future outbreak. The SARS-CoV main protein- 
ase (MP'°) plays a central role in the formation of the viral 
replicase/transcriptase complex and is thus an ideal target 
for the development of suitable drugs. 

The structures of complexes of the M’™ resulting from 
reaction with two aromatic benzotriazole esters provide 
a promising starting point for designing more specific inhib- 
itors for the proteinase. Because of their high reactivity, the 
inhibitors presented here might not be very selective, al- 
though it is encouraging to note that in the entire main 
proteinase with its 12 cysteine residues, only the active-site 
Cys145 is acylated. Furthermore, Wu et al. (2006) reported 
that these compounds are not cell toxic, at least not up to 
a concentration of 100 .M. Starting from these lead struc- 
tures, the inhibitors can now be modified by substituents oc- 
cupying the S1 and S2 specificity pockets in order to improve 
their potency and specificity. The structures presented here 
also demonstrate that electron-donating substituents in the 
benzoyl moiety are useful to stabilize the thioester bonds 
with the active-site cysteine against hydrolysis. Finally, the 


pro 


structure of the M?'° featuring an amputated N finger (resi- 
due Ser1 deleted) explains the reduced catalytic activity of 
this species. 


EXPERIMENTAL PROCEDURES 


Expression and Purification 
The C-terminally Hisg-tagged SARS-CoV MP" was cloned and overexpressed 
in E. coli BL21 (DE3) cells. Plasmid pET-SCoV-Mpro-A305-306(6x His) was 
kindly provided by Dr. J. Ziebuhr. In this plasmid, the nucleotides coding for 
the C-terminal residues Phe305 and GIn306 have been replaced by a hexahis- 
tidine tail, whereas the codon for the N-terminal Ser1 was replaced by a start 
codon. Most likely, the resulting N-formylmethionine has posttranslationally 
been cleaved off by the methionine-aminopeptidase during expression in 
E. coli (Hirel et al., 1989). Edman degradation showed the polypeptide chain 
to begin with residue Gly2. The overexpressed protein was purified by 
nickel-NTA affinity chromatography followed by a gel-filtration step. Only the 
middle fractions of the resulting elution peak were pooled and used for crystal- 
lization experiments. 

The plasmid coding for SARS-CoV MP? with authentic N and C termini was 
a gift from Prof. Z. Rao (Xue et al., 2007). The plasmid was transformed in E. coli 
BL21 (DE3) cells as an N-terminal GST fusion protein with a C-terminal tag of 
eight extra residues containing a glycine, a proline, and six histidines (GPH,). 
Because of the construction of the plasmid, the GST tag was cleaved off by 
autocatalytic processing of the active MP°. The C-terminally GPH,-tagged 
product was subjected to nickel-NTA affinity chromatography and concen- 
trated in PreScission cleavage buffer (Walker et al., 1994). The PreScission 
protease cleaved off the C-terminal GPH, tag, producing an authentic 
SARS-CoV MP'°. The wild-type protein was further purified using anion- 
exchange chromatography (Xue et al., 2007). 


Preparation of Benzotriazole Esters 

To a stirred solution of 2.5 mmol N-hydroxybenzotriazole and 2.75 mmol 
triethylamine in 4 ml dichloromethane (DCM) at room temperature, 2.5 mmol 
benzoyl chloride was slowly added. The reaction mixture was stirred for 
2 hr, diluted with DCM (20 ml), washed with saturated NaHCOg solution (2 x 
4 ml) and brine (1 x 5 ml), dried over MgSQu,, and filtered. The solvent was 
removed under reduced pressure and the residue was purified to provide 
the desired benzotriazole. 

1-(Benzoyloxy)-Benzotriazole (XP-27) 

After the reaction mixture had been worked up, the crude product was purified 
by column chromatography using diethyl ether:cyclohexane (2:3). 1-(benzoy- 
loxy)-benzotriazole was obtained in 91% yield as a white solid. Melting point 
74°C; ms (ESI) m/z = 239.0 [M]*, 105.0 [CgH;CO]*. 
1-(4-Dimethylaminobenzoyloxy)-Benzotriazole (XP-59) 

After the reaction mixture had been worked up, the crude product was recrys- 
tallized from hexane:ethyl acetate. 1-(4-dimethylaminobenzoyloxy)-benzotria- 
zole was obtained in 80% yield as a slightly yellow solid. Melting point 153°C; 
ms (ESI) m/z = 869.3 [3M + Na]*, 587.2 [2M + Na]*, 305.1 [M + Na]*. 


Enzyme Kinetics 

Both a fluorescence-based and an HPLC assay were used to assess the activ- 
ity of the SARS-CoV MP’. The former made use of the fluorogenic peptide sub- 
strate Dabcyl-KTSAVLQ | SGFRKME-Edans (95% purity; Biosyntan GmbH, 
Berlin, Germany), which contains the main proteinase cleavage site (indicated 
by the arrow). The enhanced fluorescence due to the cleavage of this substrate 
as catalyzed by the enzyme was monitored at 490 nm with excitation at 
340 nm, using a Cary Eclipse fluorescence spectrophotometer. The experi- 
ments were performed in a buffer consisting of 20 mM Tris-HCl (pH 7.3), 
100 mM NaCl, 1 mM EDTA. Kinetic parameters Ky and kca: were determined 
by initial-rate measurements at 25°C. The reaction was initiated by adding 
proteinase (final concentration 0.5 1M) to a solution containing different final 
concentrations of the fluorogenic peptide (10-40 1M). The initial rates were 
converted to enzyme activities (1umol substrate cleaved per s). Kinetic con- 
stants were derived by fitting the data to the Michaelis-Menten equation 
with the nonlinear regression analysis program SigmaPlot (SigmaPlot 2000, 
version 6.00; SPSS, Chicago, IL, USA). 
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The HPLC-based approach has been described elsewhere (Tan et al., 2005). 
Briefly, the substrate used here was the pentadecapeptide SWTSAVLQ| 
SGFRKWA, which also resembles the cleavage site at the N terminus of the 
SARS-CoV MP"°, except for the two tryptophans at either side of the cleavage 
site replacing the P7 lle and the P6’ Met residues. The 20 ul reaction mixture con- 
tained the inhibitor at concentrations varying from 5 to 500 uM, 1.5 4M MPre 
dimer (diluted in 100 mM morpholinoethanesulfonic acid monohydride [MES] 
[PH 7.0]), and 250 1M substrate. Inhibitor and MP’? were incubated for 5 min at 
room temperature before adding the substrate. The reaction was run for 5 min at 
25°C, then stopped by adding 5 ul trifluoroacetic acid (0.1%) and put on ice or 
stored at —20°C. The samples were centrifuged and 80 yl MES buffer (100 mM) 
was added. Products and substrate were separated on a reverse-phase HPLC 
column (Jupiter 441 Proteo 90A, Phenomenex, Torrance, CA, USA), using a gra- 
dient of acetonitrile in 0.1% trifluoroacetic acid. The product and substrate 
peaks were detected at 280 nm, and ICso values were determined. 


Inhibition Assay 

The fluorescence-based assay was also used for the determination of the 
inhibitor constants of the two benzotriazole compounds. These tests were 
performed with the untagged, authentic SARS-CoV MP". Values of K; and kg 
(see scheme above) were calculated from plots of 1/k,ps versus 1/[I] according 
to Equation 1 (Equation VIII.129 of Dixon and Webb, 1979): 


121~/K. [S]\ 1 
Kons Re Ks (+72) iy vi 


To obtain the observed first-order inhibition rate constant Kops, time-depen- 
dent progress curves were fitted to a first-order exponential (Equation 2; equiv- 
alent to Equation A10 of Tian and Tsou, 1982, with the addition of D; see 
below): 


F= ( vo ) “(1 — exp Met) 4D. (2) 
Kops 


F is the product fluorescence (measured in arbitrary units), Vo is the initial 
velocity, t is time, and D is a displacement term to account for the nonzero 
emission at the start of data collection. 

In the experiment, the Kj and kg values for the irreversible inhibitors were ob- 
tained from reactions initiated by addition of the enzyme (final concentration 
0.5 uM) to 20 mM Tris-HCl buffer (pH 7.3) containing 100 mM NaCl, 1 mM 
EDTA, 20 uM fluorogenic substrate, and inhibitor. Four different inhibitor 
concentrations were tested, in at least 10-fold molar excess over the enzyme 
in most cases. Data from the continuous assays were analyzed with the 
nonlinear regression analysis program SigmaPlot. 


Crystallization of the Complexes 

Before crystallization, the C-terminally His-tagged SARS-CoV MP"° ([ASer1]/ 
C-Hisg) was dialyzed against 50 mM Tris-CH3;COOH (pH 7.5), 1 mM EDTA, 
5 mM DTT and concentrated to 10 mg/ml. The protein was crystallized at 
18°C by vapor diffusion using hanging drops or sitting drops. The crystalliza- 
tion conditions were 5% PEG 10,000, 0.1 M MES (pH 6.5), 3% ethylene glycol, 
50 mM ammonium acetate (Lee et al., 2005). Plate-shaped crystals of mono- 
clinic form appeared after 3 days to a week, sometimes accompanied by 
needles and irregularly shaped crystals, which turned out to belong to the te- 
tragonal space group (see Tan et al., 2005). One of the monoclinic crystals 
was soaked for 4 days at 18°C in the crystallization buffer solution containing 
5 mM 1-(benzoyloxy)-benzotriazole (XP-27) and 10% DMSO to increase the 
solubility of the inhibitor. 

The untagged SARS-CoV MP"? with authentic chain termini, concentrated to 
10 mg/ml in 50 mM Tris-HCl (pH 7.5), 40 mM NaCl, 1 mM EDTA, 5 mM DTT, 
was incubated for 2 hr at 25°C with a 7-fold excess of the inhibitor 1-(4-dime- 
thylaminobenzoyloxy)-benzotriazole (XP-59) and solubilized in 8% PEG 6000, 
0.1 M MES (pH 6.0), 3% 2-methyl-2,4-pentanediol (MPD). Block-shaped crys- 
tals grew overnight at 20°C in 6%-8% PEG 6000, 0.1 M MES (pH 6.0), 3% 
MPD, after initiation of nucleation by microseeding using crushed C2 crystals 
of the free enzyme. 

Prior to data collection, crystals obtained from both procedures were trans- 
ferred for a couple of seconds to a cryoprotectant solution containing the 
crystallization conditions and 30% PEG 400. 
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Crystallographic Data Collection and Processing, and Structure 
Elucidation and Refinement 

Statistics of data collection, processing, and refinement are summarized in 
Table 1. Diffraction data were collected at 100K using monochromatic syn- 
chrotron radiation provided by beamlines X11 and X13 (EMBL and Universities 
of Hamburg and Ltibeck, DESY, Hamburg, Germany; wavelength 0.8157 and 
0.8080 A, respectively). Intensities were measured using a MAR CCD detector. 
Indexing, scaling, and merging of data sets were performed using DENZO and 
SCALEPACK (Otwinowski and Minor, 1997). Molecular replacement and 
refinement were carried out using MOLREP (Vagin and Teplyakov, 1997) 
and REFMAC (Murshudov et al., 1997), respectively, as implemented in the 
CCP4 suite (CCP4, 1994). Simple bulk scaling in REFMAC was preferred 
over Babinet scaling because this procedure better reflected the overall Wilson 
B factor. The search model used for molecular replacement was either the 
dimer or monomer A of the crystal structure of PDB ID code 1UJ1 for the 
data sets of SARS-CoV MP’? complexed with XP-27 or XP-59, respectively. 
The computer graphics program Coot, implemented in the CCP4 suite, was 
used for interpretation of the electron density maps and model building (Ems- 
ley and Cowtan, 2004). The molecular graphics package PyMOL was used to 
generate the figures (DeLano, 2002). 


ACCESSION NUMBERS 


The atomic coordinates and structure factors, respectively, have been depos- 
ited in the RCSB Protein Data Bank under ID codes 2VJ1 and r2vjisf (for 
Complex 1), and 2V6N and r2v6nsf (for Complex 2). 
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